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Abstract — The information spectrum approach gives general 
formulae for optimal rates of various information theoretic 
protocols, under minimal assumptions on the nature of the 
sources, channels and entanglement resources involved. This pa- 
per culminates in the derivation of the dense coding capacity for 
a noiseless quantum channel, assisted by arbitrary shared entan- 
glement, using this approach. We also review the currently known 
coding theorems, and their converses, for protocols such as data 
compression for arbitrary quantum sources and transmission of 
classical information through arbitrary quantum channels. In 
addition, we derive the optimal rate of data compression for a 
mixed source. 

Index Terms — Quantum information, dense coding capacity, 
quantum data compression, classical capacity, information spec- 
trum. 



I. INTRODUCTION 

QUANTUM information theory generalizes the ideas of 
coding and communication to include the nature of the 
physical system in which information is encoded. The informa- 
tion spectrum approach of Han & Verdú [2], [3] gives general 
formulae for many operational schemes in information theory. 
It replaces the idea of typical events (generally called typical 
sequences) in information theory, with high probability events. 
The power of this approach lies in the lack of assumptions 
about the source, channel and entanglement resource. 

The quantum information spectrum was defined in terms 
of quantum states by Hayashi & Nagaoka [6], initially in the 
context of hypothesis testing, and was used to determine a 
general expression for the classical capacity of arbitrary quan- 
tum channels. The quantum information spectrum extends the 
idea of high probability events to high probability subspaces 
of states in a Hilbert space. In the commutative case, the 
quantum information spectrum simply reduces to its classical 
counterpart. 

In this paper we present a review of coding theorems for 
quantum data compression and transmission of classical infor- 
mation through a quantum channel. The rate of compression 
for a mixed source is explicitly derived. A number of new 
results are also presented, including the dense coding capacity 
for a noiseless quantum channel, assisted by arbitrary shared 
entanglement. 
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II. Preliminaries 

Let B(Ti) denote the àlgebra of linear operators acting on a 
finite-dimensional Hilbert space TL of dimension d. The von 
Neumann entropy of a state p, i.e. a positive operator of unit 
trace in B(H), is defined as S(p) = — Trplogp. Throughout 
this paper, we choose the logarithm to base e. We could equally 
well choose an arbitrary base for the logarithm. This would 
simply scale the unit of information. 

A quantum channel is given by a completely positive trace- 
preserving (CPTP) map $ : B{JC) -> B(H), where K and H 
are the input and output Hilbert spaces of the channel. 

A. Spectral Projections 

The quantum information spectrum approach requires the 
extensive use of spectral operators. For a self-adjoint operator 
A written in its spectral decomposition A = we 
define the positive spectral projection on A as 

{A>Q}=J2\i)(i\ d) 

A,>0 

the projector onto the eigenspace of positive eigenvalues of A. 
Corresponding definitions apply for the other spectral projec- 
tions {A < 0}, {A > 0} and {A < 0}. For two operators A 
and B, we can then define {A > B} as {A — B > 0}, and 
similarly for the other ordering relations. 

B. Two Important Lemmas 

The following key lemmas are used repeatedly in the paper. 
For their proofs see [1]. 

Lemma 1: For self-adjoint operators A, B and any positive 
operator < P < I the inequality 

Tv[P(A - B)] < Tt[{A > B}(A - B)] (2) 

holds. 

Lemma 2: For self-adjoint operators A and B, and any 
completely positive trace-preserving (CPTP) map T the in- 
equality 

Tr[{T(A) > T(B)}T(A- B)] < Tr[{A > B}{A - B)} 

(3) 

holds. 

We also make use of the following proposition 
Proposition 1: Given a state p n and a self-adjoint operator 
u> n , we have 

Tr[{p n > e^avK] < e"^. (4) 

for any real 7. 
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Proof: We have 

Tr[{p„ > e n ^Lü n }{p n - e^u n ] > (5) 
and hence, by rearranging terms 

Tr[{p„ > e n ~<uj n }uj] < e-^Tc[{pn > e^ui n }p n ] < e~ n \ 

(6) 

where Tr[{p„ > e n ^uj n }p n ] < 1. ■ 

C. Quantum Spectral Information Rates 

As a generalization of the relative entropy, the spectral diver- 
gence allows information theory to include arbitrary sources 
and channels. 

Definition 1: Given the difference operator l·l n ( 7 ) = p n — 
e n7 uj n , the quantum spectral sup-(inf-)divergence rates are 
defined as 

D(p\\lü) = inf | 7 : limsu P Tr[{n n ( 7 ) > 0}n„( 7 )] = o} 

(7) 

72Go|M = sup{ 7 : liminfTr[{n„( 7 ) > 0}n„( 7 )] = l} 

(8) 

respectively. 

The spectral entropies, conditional spectral entropies, and 
spectral mutual information rates may all be expressed as a 
divegrence rate with appropriate substitutions for the sequence 
of operators u — {w„}™ =1 . These are 

S(p) = -RW) (9) 
S{p) = -D(p\I) (10) 

and for sequences of bipartite state p AB = {p AB }^ = \, 

S(A\B) = -D{p AB \I A ®p B ) (11) 

S(A\B) = -D{p AB \I A ® p B ) (12) 

S{A:B) = D(p AB \p A ®p B ) (13) 

S(A:B) = D{P AB \P A ® P B ), (14) 

giving all the spectral sup(inf)-information rates. Various prop- 
erties and relationships of these quantities are explored in [1]. 



III. Data Compression for Arbitrary Quantum 
Sources 

A general quantum source consists of a sequence of density 
P = {Pnj^Li acting on a corresponding sequence of Hilbert 
spaces H = {H n }n=i- 

A compression scheme for such a source, p, consists of 
two families of quantum operations C n and T> n . Here C n 
denotes the compression operation which takes states in the 
original Hilbert space fí n to states jin a Hilbert space fí n such 
that dim H n < dim H n . Hence, Ti n can be regarded as the 
compressed Hilbert space. The corresponding decompression 
operation, T> n , takes states in H n to states in the original 
Hilbert space 7í n . 

The compression scheme given by the family of combined 
compression decompression maps T> n oC n is said to be reliable 
if the entanglement fidelity F(p n ,T> n oC n ) tends to 1 as n — > 



oo. Let P n denote the orthogonal projection onto H n . The rate 
of the compression scheme is determined by 



R = lim sup — log M n , 

n — í-oo Tl 



(15) 



where M n := TrP„ = dim7í n . 

The objective is thus to obtain the optimal rate of reliable 
compression for a given source p. Defining the optimal rate 
1Z as the infimum of all reliable rates, leads to the following 
theorem. 

Theorem 1: The quantum spectral sup-entropy rate is opti- 
mal. Hence, 

Tl = S(p) (16) 

for a given source p. Equivalently, (i) if R > S(p) then there 
exists a reliable compression scheme of rate R, and (ii) there 
can be no reliable compression scheme of rate R for R < 
S(P). 

Proof: [Proof of (i) :] Suppose R > S(p). Consider the 
compression operation, C n , defined by its action on any state 
(T r i € B(fí n ) as follows: 



k 



(17) 



where (a) P n , the compression projection, i.e.^the orthogonal 
projection onto the compressed Hilbert space 7í„, is given by 



P„ 



{Pn > e-^In}, 



(18) 



and (ò) Ak := |xo) (k\, with |xo) being a fixed pure state in Tí n 
andjj k) } being an orthonormal basis for the orthocomplement 
of fí n . Equivalently, 



C n {<Jn) ■= PnVnPn + Tr((J„ - P n )<7„) |%o) (Xo| • 



(19) 



The corresponding decoding operation T> n is defined to be the 
identity on Tí n - 

If {Cl} and {-D^} denote finite sets of Kraus operators of 
the quantum operations C n and X>„ respectively, then 

F n := F{p n ,V n oC n ) = \^{ D ÍCÍp n )\ 2 - (20) 

jk 

and hence the entanglement fidelity is given by 

F(p n ,V n oC n ) = |Tr(P„p n )| 2 +^|Tr(A fe p„)| 2 



> \Tr(P nPn )\ 2 

> |Tr[P„(p n - 



-7T.7 



In)}\ 



- |Tr[{p„ > e- n T n }(p n - l n )] | 2 
= |Tr[{n„( 7 )>0}H„( 7 )]| 2 , 

(21) 

where II„( 7 ) = p n — e ni I n . From the definitions in (|9} and 
(|0 it follows that the RHS of tend s to 1 as n — * oo, for 

any 7 > S(p). 

Utilizing Proposition^ the dimension of the compression 
projections P n is bounded for each n by 



TrP n = Tr[{p„> e- n T/ n }] < 



,"7 



(22) 
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for 6 > 0. Since this is true for all 6 > we have 1Z < S(p). 

[Proof of (ii) (Weak Converse):] Suppose R < S(p). 
Without loss of generality, assume that C n maps jitates in 
Tí n to states in an Aí„-dimensional Hilbert space 7í n , with 
M n = [e nR \ . Hence, if P n is the orthogonal projection onto 
Un then Tr[P„] = M n < e nR . 

Let {C^} and {D k } denote finite sets of Kraus operators 
for the quantum operations C n and T> n respectively. Obviously, 
P n C^ = C^. Further, let Q^be the orthogonal projection 
onto the subspace to which 7í n is mapped to by D k . Then 
D*Cg = D k n P n Ct = Q k n D k n P n Ci = Q k D k Ct Moreover, 
Tr[QfJ < Tr[P n ] since V n is a CPTP map. 

The entanglement fidelity can be expressed as 

jk 

= J2\^(QnD k n CÍPn)\ 2 

jk 



= ^|Tr[(^C^)(V^Q^)]| 2 

jk 
jk 

<Tr[P nPn ] 

<Tr[{p n >e-^I n }(p n -e- n ^I n )] 
+ e - n ~<TrP n 



(23) 



(24) 



(25) 



To arrive at A23I . we have made use of the Cauchy Schwarz 
inequality for the Hilbert-Schmidt inner product, specifically 
|Tr(^tB)| 2 < Tr(AtA) • Tt(B^B). The inequality in (EU 
uses the inequality TrQ^ < TrP„, and the fact that C n and 
T> n are trace preserving maps. The final inequality in (125 \ 
follows from Lemman 

Using the fact that TrP„ < e nR , we have 

F n < Tr[{p n > e-^I n }(p n - e"^)] + e~ n ^- R \ (26) 

Choosing a number 7 and S > such that R = 7 + 5 < S(p), 
the second term on RHS of (1261 tends to zero as n — > 00. 
However, since 7 < S(p) the first term on RHS of (1261 does 
not converge to 1 as n — * 00. Hence, the fidelity does not 
converge to 1 in the limit as n — ► 00 and the compression 
scheme is not reliable. ■ 
The proof of the weak converse above shows that for 
R < S(p) the entanglement fidelity cannot approach unity, and 
hence any compression scheme will give an error with non- 
zero probability. To determine the rate at which the probability 
of error converges to 1 for any compression protocol we can 
equivalently determine the supremum of the rates for which 
the asymptotic limit of the entanglement fidelity goes to zero. 
Here we prové the result for the strong converse rate denoted 
by K*. 

Theorem 2: Coding a source p at a rate less than the quan- 
tum inf-spectral entropy rate gives an error with probability 
equal to one. That is 



R<S{p) = 
or, equivalently 1Z* — S_(p). 



lim F n = 0. 

n — >oo 



(27) 



Proof: From Í25Í we can immediately see that for rates 
R < S_(p) choosing a 7 = R + S < S_{p) we obtain 

lim F n = (28) 

n — >oo 

and the compression scheme fails with probability approaching 
1 as n — > 00. ■ 

A. Relationship to the von Neumann Entropy 

For any quantum information source p, the quantum spectral 
sup- and inf- information rates are related to the von Neumann 
entropy in the following manner. 

Lemma 3: The sup-information and inf-information rates 
are related to the von Neumann entropy by 

S{p) < lim inf -S{p n ) < lim sup -S(p n ) < S(p) (29) 

for any source p. 

Proof: Let {A^} denote the set of eigenvalues of state 
p n . For the first inequality we have 

-S(p n ) = --Tr[p„logp„] 
n n 1 J 

i 

>-- E K^èK 

X< i < e -«CS(í>)-fl 

> --Tr[{ Pn < e-^-^}p n ] log e -^)- 5 > 
n 

= (S(p) - S)Tt [{ Pn < e -»(^P)- 5 ) } Pn ] (30) 

and from the definition of S_(p) we have lim^oc Tr[{p„ < 
e -n(S(p)-5)jp^ _ ^ an( j tj^jg j s a - ue f or a u ,5 > 0, implying 



S_(p) < lim inf — S(p n ) 

n— >oc TL 



(31) 



Similarly, we have 

-S( Pn ) = -- ]T X n l0 &K 



\i > e -n(S(p) + 5) 

n z - i 

Aj i <e-"( S (P)+' 5 ) 

< (S( P ) + S)Tr[{p n > e- n ^ + V} Pn ] 
1 



-Tr[Q Pn log 



(32) 



where Q n {p n < e -«( s (p)+ 5 )}. Let W n := Q nPn Q n and 
define the normalized state W n := W n / {TïW n ). Hence, 

-S(pn) < (S(p) +5)Tv[{ Pn > e- n ^ + ^} Pn } 
n 

- ÍTrW n (log W n + logTrV7„ - logTrW„) 
= (S(p) + S)Ti[{ Pn > e^^jpn] 

- -TvW n S(W n ) - -H(TrW n ), 
n n 



< 



(S(p) +S)Tv[{ Pn > e-^^jpn] 

-logd n TrW„--íf(TrW„) 
n n 



(33) 



4 



SUBMITTED TO IEEE TRANSACTIONS ON INFORMATION THEORY 



In the above, H(-) denotes the Shannon entropy. and 
d n = dimft n . Since lim^oo TrW n = lim^^ Tr[{,9„ < 
e -n(S(p)+S)y p l _ tne j ast term yanishes in this limit. The 
second term also vanishes under the assumption that for all n 



1 



log d n < (3 



(34) 



for some j3 < +oo. Moreover, since \im n ->oo Tï[{Pn > 
e -n(S(p)+5)j. pn j = x> we have 



limsup-5(p„) < 5(p). 



(35) 



The remaining inequality follows from the definition of lim inf 
and limsup. ■ 



B. Mixed Sources 

Given two sources a — {(7,^^=1 an d w = { w n}^Li> we 
define the mixed source p = {p n }n°=i to t> e the source for 
which 



pn = ta n + (1 - t)w„ 



(36) 



for t G (0,1). 

Theorem 3: For the mixed source p the optimal rate 1Z is 
given by 

K = max[S(a),S(u)], (37) 

the maximum of the rates for either source a or ui. 

Proof: Let Tr[ll n ( 7 )] = Tr[{p„ > e-^I n }(p n - 
e~ ni I n )\, then from the linearity of the trace operation, we 
have 

Tr[n„( 7 )] = íTr[{p„ > e~ n ~<I n }(u> n - I n )\ 

+ (1 - í) Tr[{p„ > e-^I n }(a n ~ I n )} 

< tTr[{u n > e-"^„}K _ e -^I n )] 

+ (1 - t) Tr[{a„ > e- n ~*I n }(a n - e~^I n )] 

(38) 

where the inequality follows from LemmaQ Hence for any 
7 = S(p) + S, the limit of the LHS goes to one, and hence 
both of the traces on the RHS must also approach one in the 
limit. This then implies that 



S(p) > max\3(o),S(u)] 



(39) 



as S is arbitrary. 

To prové the reverse inequality we explicitly construct 
a sequence of projection operators. For each a > we 
utilize the projections P° := {er„ > e~ na I n } and Q n := 
{oj n > e~ na I n }, Let Q n have the spectral projection Q n = 
J2f=i K><*1, with K = TrQn- Starting with P°, we define a 
sequence of projection operators P n , i — 1, . . . , K, iteratively, 
as follows. For each i, if \i) lies in the subspace onto which 
P n ~ X projects, then we set = P n ~ X ■ Otherwise, we take 
the component of \i) orthogonal to this subspace, say \i ), 
andleti*[=i»*- 1 e|*· 1 ·)<i- L |. 



From Lemma 1 it then follows that 

Tr[n n ( 7 )] >Tr[P«(p n -e- n n n )] 
= íTr[P I fK-e-^/„)] 

+ (l-í)Tr[P n *K-e-^J„)] 
>tTr[{u n >e-" a í„K] 

+ (l-t)Tr[K>e- la I n K] 

-e-" 7 Tr[Pf ] 

>tTl[{Lü n >e- na In}Lün] 

+ (l-í)Tr[{ ( i„> e - nQ /„K] 

_ 2e -"('r- Q ) (40) 

where Tr[P^ ] < 2e na , as the rank of the projector cannot be 
greater than the sum of the ranks of the projectors P° and Q n - 
For every S > and a = max [ÏS(<t), S(ui)] + S, the limit of 
the sum of first two terms on the RHS goes to 1 . By choosing 
7 = a + S this implies both the RHS and LHS converge to 1 
and hence that 



S(p) < max [5(ct),5(. 



ui 



as S is arbitrary. ■ 
Corollary 1: The strong converse is given by 

TV = min [5(o"), 5(0;)] (41) 

for any mixed source p n = ta n + (1 — t)ui n , for t £ (0, 1). 

Proof: Choosing 7 and a such that the RHS and LHS 
of (1381 and j40> go to zero, respectively, gives the required 
inequalities. ■ 
A source obeys the strong converse property only if 



S(p) = lim -S(p n ) = S(p) 

n — >oo fl 



(42) 



Note that mixed sources do not obey the strong converse 
property if max \S(a), S(ui)\ > min \S.(cr) , S_(ui)\ . This can 
easily be shown to hold for mixtures of stationary memoryless 
sources with different entropies S(a) > S(ui). 

IV. Classical Capacity of an Arbitrary Quantum 
Channel 

In this section we obtain the classical capacity of a sequence 
of arbitrary quantum channels in terms of the inf-spectral 
mutual information rate of bipartite separable states shared 
through the channel. 

Let {/C^ n) }^ =1 and be two sequences of Hilbert 

spaces, and let A = {A^}^ x be a sequence of quantum 
channels such that, for each n, 



(n) 

Here iCq denotes the Hilbert space at the input of the channel 

A^, whereas TÍq denotes the Hilbert space at its output. 

Consider the following scenario. Suppose Alice has a set 
of messages, labelled by the elements of the set M = 
{1,2,..., M n }, which she would like to communicate to Bob, 
using the quantum channel A^. To do this, she encodes each 
message into a quantum state of a physical system with Hilbert 
space JCq and sends this state to Bob through the quantum 
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channel. In order to infer the message that Alice communi- 
cated to him, Bob makes a measurement (described by POVM 
elements) on the state that he receives. The encoding and 
decoding operations together define a quantum error correcting 
code (QECC). More precisely, a code C^ n > of size M n is 
given by a sequence {pl l ,EÍ l } n ^ í where each p % n is a state 
in 0(/Cq ) and each E l n is a positive operator acting in H < q\ 
such that J2fí\ El < I n . Defining E° n = I n - E* n , 
yields a resolution of identity in 7Íq í ' > . Hence, {E^} 1 ^ defines 
a POVM. An output i > 1 would lead to the inference that the 
state p l n was transmitted through the channel A%, whereas the 
output is interpreted as a failure of any inference. In other 
words, a code C^ n ' is given by a triple (M„, <p n , E n ), where 
(f> n is the encoder, i.e., 4> n (i) = p l n for i 6 {1,2, . . . , 2 nR }, 
and E n = {-E^J^Í™ is the decoder. The rate of the code is 
given by (l/n)logM n . The average probability of error for 
such a code C^ n ' is given by 

* Mn 

Pe{C (n) ) := w J2 (1 - M<K)) , (43) 

i— 1 

a % n being the output of the channel when the input is the i th 
codeword p l n . A quantity R E R is said to be an achievable 
rate if there exists an JV e N such that for all n > N, there 
èxits a sequence of codes {C < ™- ) }^L 1 with M n > e nR , and 
P e (C (n) ) -> as oo. 

The capacity of A is defined as 

C(A):=supiï, (44) 

where R is an achievable rate. 

Theorem 4: The classical capacity of a sequence of chan- 
nels A = {AQ}^ =1 is given by 

C(A) = max : AQ) (45) 

where (i) S is the set of sequences of separable states in 
B(Haq), with Haq being a sequence of Hilbert spaces 
H A q := {H { 2 ] ® /cf}£° =1 , and (m) 5(A : AQ) is the inf- 
spectral mutual information rate of a sequence of separable 
density matrices {p AA ®}^Li- 

Consider an arbitrary set X^ of indices and define a 
separable state 

ín) (n) 

acting in a Hilbert space 7i A ® /Cq . The set of codewords 
that Alice uses, to transmit her messages to Bob, is a finite 
subset of the set 

{p% : x e *<*>}. 
The state can be purified to the state 

pí a ' q --= E pi\*)W aa '®pIx 

in B{H { 2 ] ® H { A ) ® K, ( q ] ). Let B denote the bipartite system 

(n) (n) 

with Hilbert space 7í A £g> rt^/ (and thus replace the super- 
script AA' by i?). Let Q denote system with Hilbert space 



K>q>. A state p B ® of the form given by (146 \ is referred to 
as a classical-quantum state 1 . If A is a random variable with 
probability mass function {p^ : x S X^ n '}, then the state of 
the quantum system Q is correlated with the vàlues taken by 
the classical index X. The state p B< * therefore represents the 
preparation of quantum states p® x corresponding to classical 
indices x G X^ n \ according to the apriori distribution {p^}. 

The action of the channel on the system Q yields the 
state 

:= (id fl ®A«)(^«) 

= Y, P X n\x){A B ®^{pl X ) 

:= E PSI«X*I B ®^- (46) 

Here the superscript AQ is used to denote the system Q after 
the action of the channel on it. 

For the sequence of classical-quantum states {p BA ®} the 
inf-spectral mutual information rate is given by 

S(B : AQ) = su P { 7 : Hm Tr[{n„( 7 ) > 0}n„( 7 )] = l} 

(47) 

where n„( 7 ) := p BA Q - p B ® pW, and p B , p^ are 
the reduced density matrices of the systems B and AQ 
respectively. 

The proof of the Theorem |4]relies on the following lemma 
proved in [6]. 

Lemma 4: For any n 6 N, M € N, and 7 6 R, given 
a probability distribution {p*} on there exists a code 

CW of size \C^\ = M, whose average probability of error 
satisfies the following bound: 

P e (C (n) ) < 2 £ KTr[{^-e">^<0}p^] 

+4e-" 7 M, (48) 

where 

rïi ^7 ^ lnrn,x 

xEXi") 

Proof of Theorem We shall first prové that for any rate 
< R < S(B : AQ), the average probability of error P e (C(") 
vanishes asymptotically. Here S(B : AQ) denotes the inf- 
spectral mutual information rate for a sequence of classical- 
quantum states {pn A ®}%Li an d is given by (147 \ . 

Computing the reduced density matrices of the bipartite 
state p BAQ (defined by (|46j) yields 

P%®pt Q = Ç£pl\x){x\ B )®Pn Q , (49) 

X 

where p A ® := J2 x PnPn%- The difference operator n„( 7 ) 
appearing in (147 \ is given by 

n„( 7 ) - YiïWW 3 ® - ^Pn Q )- (50) 

X 

'According to the terminology of [10] it is the density matrix which one 
can associate to a c-q resource given by the ensemble {p^, pn x}- 
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Note that 



The average probability of error can thus be expressed as 



Tr 
Tr 



{n„( 7 ) > o}n„(7) 



p e (c (n) ) - 1 



{n„( 7 )>o}( pI\x){x\ b ® ( P 



AQ 
n.x 



1 - 



(51) 



Un 

^Tr[i^A^)] 

Un 

E Tr t« 

i=l 



(56) 



Hence, S_(B : AQ) is equivalently given by 
su P { 7 : Hm 5>*Tr[{p^ > p AQ } (p AQ x - 

This implies that for any 7 < : AQ), 

lim V^Tr[{p^< e «7 p AQ }í) AQ] : 



where ^« = (#®A«)rtí«. 
From Lemma[2it then follows that 



)]-!}■ 



P(C 



(52) 



For M n — \e nR ~\, Lemma |4] ensures the existence of a 
sequence of codes {C ( ™· ) }^L 1 of size \C^ n '\ = \e nR ~\, such 
that for each n 



Pe(C 



(n)\ 



< 



KTr[{p 



AQ 



(53) 



for any 7 £ R and c > 0. From (I52> it follows that for any 
7 < S(B : AQ), the first term on the RHS of (|53j vanishes 
in the limit n — ► 00. For all <5 > 0, there exists no <E N, such 
that for all n > n , \e nR ~\ < e n ^ R+s \ Hence, 

4e~" 7 \e nR ~\ < 4~"(' f ~( íï +' 5 )) j 

which vanishes as n — > 00 for 7 > i? + 5. Since <5 is arbitrary, 
it follows that any rate R < j < S_(B : AQ) is achievable. 
More generally, any rate < R < S_(B : AQ) is achievable. 

To prové the (weak) converse we are required only to show 
that for any code with rate larger than the capacity, there 
exists a probability distribution on the codewords such that the 
average probability of error does not vanish asymptotically. 

Define a family of codes of size M n by the average state of 
the codewords p®. Note that the family includes all possible 
sets of M n codewords with the same average state. Given 
the family {M n , p®}^ D =1 we can extend to any separable 
state p At 3 on an enlarged Hilbert space. The outcome of any 
measurement on A is thus classically correlated with a state 
on Q. 

Explicitly, we can assign the message that has been sent 
with the outcome of the set of measurements on A, described 
by a POVM {E^}, such that message i G {1,2, ...,M n } is 
generated with probability 



Pi = Tr[(E^^lQ)piQ 
and results in the codeword 

= Tr A 



Pn 



Eí 



In Pn® 



Eí 



(54) 



(55) 



which is then sent throught the noisy channel. 



>i-Tr[{n„( 7 )>o}n„( 7 )] 

i=l 

= i-Tr[{n„( 7 )>o}n„( 7 )] 

Pi^[E%PÏ Q ) 



P AQ ] 



M„ 



1=1 



(57) 



with n n ( 7 ) = Pn A ® ' — e ni p A ® Pn® '> an ^ where the probability 
Pi is given by ( |5H . 

Choosing only those POVMs such that 

1 



Pi 



(58) 



is sufficient to show that any code of size M n is not reliable. 
In this case 



Pe(C 



> 1 - Tr [{n„ (7) > 0}n„ ( 7 )] - — (59) 



and for any 8 > 0, choose M n = \e nR ~\ where R = S_(A : 
AQ) + 2(5, and 7 = S(A : AQ) + 6. Thus, the third term on 
the RHS of (1591 vanishes in the limit n — > 00. However, the 
difference of the first two terms does not vanish and we have 
limsup IWOO P e (£(")) > eo for some e > 0. 

We thus conclude that the classical capacity of a sequence 
of channels A = {A^}^^ is given by 



C(A) = max S(B : AQ) 



(60) 



where Q denotes the set of sequences of classical-quantum 
states in B(Hbq), with Tíbq being a sequence of Hilbert 
spaces Hbq '■= {Hg' (8> K.q''* }™ =1 , The monotonicity of the 
inf-spectral mutual information rate under CPTP maps (see 
[1]) implies that S(B : AQ) = S(AA' : AQ) > S(A : AQ). 
This ensures that optimization over classical-quantum states is 
equivalent to optimization over separable states, thus yielding 
the state ment of Theorem |^ 

V. Dense Coding 

Dense coding is the protocol by which prior shared entan- 
glement between a sender (Alice) and a receiver (BOB) is 
exploited for sending classical messages through a noiseless 



quantum channel. Let p AB £ Ti 



/(») 



(n) 

7í B be an entangled 



mixed state that Alice and Bob initially share. As in Section 
IIVI Alice has a set of messages, labelled by the elements 
of the set Ai„ = {1, 2, . . . , M n }, which she wishes to 
communicate to Bob. However, the quantum channel that she 
uses is noiseless. She encodes her messages into her part, A, 
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of the bipartite system AB which is in the state p AB ■ The 
codewords are given by 



B\ AB 



for i = A4 n . Here <p n denotes the encoding map for a code of 
size M n as defined in terms of the CPTP maps £ A i7 i £ M n - 
Let Bob's measurement on the states p AB that he receives, 
be given by E AB = {E AB }^, with each E A f > and 
Sf=i Ettf — ■ The average probability of error of the 
code C(™)' = (M„, (j> A ,E AB ) is given by 



P e (C 



1 



M„ 
1=1 



n.i n.i 



» 



(61) 



The dense coding capacity for a sequence of bipartite states 
p AB = {p AB }™ =1 is defined as 

C DC ■= swpR, (62) 

where R is an achievable rate. 

Theorem 5: The dense coding capacity for a sequence of 
bipartite states p AB = {p AB }n°=i is given by 



C DC = \ogd- min S(AA\B) 



(63) 



where A = {A A }^ =1 is a sequence of CPTP maps on A. 
Proof: [Converse] For a code of M n codewords 
= {cf>ni ® and measurement operators E A f, 

i = 1, . . . , M n , the average probability of error (16 11 satisfies 



rn.t 



i 



-Ti-7 



> 1 



max 



Tr[n n!Í ( 7 )] 



where 11^(7) = {p AB > e^I A 



e n(logrf — i) 



(64) 



In the above we have used Lemma[2and the facts that 
ÍZ,E AB < I AB and Tr I A = e nlo ^ d . 

If we then assume that M„ > e nR = logd — 
minA S(AA\B) + 25 for some 5 > 0, then we can choose 
7 = min^ S((f>A\B) — 5, and we find 

limsupP e (C (n) ) > e > (65) 



where 



Y p x p ab 

/ j fnrn,x 



Choose X( n > to be a set of size N n — d 2n and define 
a probability distribution {p%} on it, with p ^ — l/N n — 



-2n log d 



sd for each x £ X( n >. Further, consider states p AB 



defined as follows: 



(U A x A A ®ïd B ))p AB 



Here denote quantum operations for which the sequence 
{A A }n=i minimizes S(AA\B), and (ii) U A X , x G X^ n \ 
denotes unitary encodings with the shift-multiply operators 
t/( p>g ), with p,q G {0, 1, . . . , (d™ — 1)}, which are defined 
as follows ( [7], [12]): 

U(p, q )\j) = e~ \j +q(moàd)), 

with {\j) : j G {0, 1, . . . , (d n — 1)} being an orthonormal 
basis in a cP-dimensional Hilbert space. 
Let 

„AAB í\A^.- A B\AB 
Pn ■= ( A n ® ld )Pn ■ 

For the ensemble {p^, P AB } 

Ex AB x(uA „;íB\ A 

PnPn,x = 2^ íM"ri,z® ld )Pn 

xex("i xexw 
I A 



AAB 



d" 



Pn, 



(67) 



where p B is the reduced density matrix of the state p^ AB ■ 
For the ensemble {p n ,p AB } defined above, let 

Pn^[{p AB >e^p AB }p AB ] 



E 

xex( n ) 



We have that 



«n>| E Tr[{p AB >e^p AB } 

= Tr[{ P r B >e- n(logrf -^®P r f} 

x^-e-^^I;*®^)]. (68) 

In the above we have made use of the fact that the trace 
remains invariant under a unitary transformation. If 7 = 
log d-S(AA\B)-6 for any S > 0, the RHS of Jg8j goes to one 
as n ~ * 00. Hence the RHS of i66\ vanishes asymptotically, 
implying that a rate R = log d — minA S(AA\B) — 6 is 
achievable for any 6 > 0. ■ 



implying Cuc < logrf — min^ ■ 

Proof: [Coding] Lemma|4] adapted to the case of dense 
coding, states that for any n G N, M £ N, and 7 G R, given 
a probability distribution {p n } on X^ n \ where X^ is a finite 
set of indices, there exists a code C^ n ' of size 
whose average probability of error satisfies the bound 



Pe(C (n) ) < 2 



E 



^Tr[{<f < e"^ B }<f ] 



+4e-" 7 M, 



(66) 



A. Reduction to the i. i. d. Case 

For entanglement resources which are tensor products of 
identical bipartite states = p®^, with qab £ B(7í), the 
dense coding capacity was shown in [11] to be given by 



C DC = log d + 5(5) - inf inf ^5((A^ } ® id^ ' Je2i 

(69) 

Here S^S) = S(qb), where qb is the reduced density matrix 
of the system B, corresponding to the state Qab- 
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For sequences of i. i. d. states u> = {i?®™}^! and a = 
{s® n }n=v Theorem 4 of [5] states that 

D(w\\a) = £>(<%) =D(u>\\a). (70) 

For bipartite states i3 — Sab and ç = \Ia ® $b, ( 17 0> implies 
that 

S(A\B) = S(A\B) = S(A\B), (71) 

where S(A\B) = S(iï AB ) - This is because logd- 

S(A\B) = D(u\\a) = D(ú\\ç) = logrf - M^abWT) = 
logcí — and similarly for D(uj\\a). If instead, we 

choose Aab and çab to be states in B(Ti® N ), given by 

and 

ÍAB — ^T-ÍA ® QB i 

then ( I70> yields the identity 

S(AWaIB) = Í5((A^ ® id^)^) - (72) 

Hence, in this case our expression (I63> for the dense coding 
capacity reduces to d69l . 
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